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ABSTRACT 

Discussion of higher education policy has been 
haapered in the past by an inability to forecast the effects on 
student behavior of proposed policies. This report describes efforts 
to develop a theoretical and empirical aodel of student behavior that 
will help to establish a aethod of forecasting enrollaent patterns. 
In this aodel, actual college enrollnents are the result of decisions 
aade both by college adainistrators and by prospective students. The 
adainistrators deteraine feasible alternatives for students, «ho then 
select the "best" college. If this b*jst alternative is aore 
attractive than the various possibilities other than college, 
enrollaent follows. In this report, the student's decision problea is 
separated into successive stages. Foraally, at each stage of the 
student Is decision problea, he or she aaziaizes a utility function 
defined over the relevant alternatives. The results show that a 
utility-aaxiaizing view of student behavior offers a useful 
perspective on enrollments. Results also indicated that price— both 
tuition and rooa and board charges— had a lower effect on the 
decisions of students froa higher-incoae faailies. Also the effect of 
parental education on college-going behavior declines with increasing 
faaily incoae. After presenting the estimation results, the potential 
use of the aodel for forecasting enrollaent patterns is illustrated. 
(Author/PG) 
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Discuss ion of higher education policy has been hampered in the pddt 
by «in in.'ihnity to forecant the effects on student behavior of proposed 
policies. This report describes our efforts to develop a theoretical 
and empirical model of student behavior that will help to establish a 
method ot forecasting^ enrollment patterns. 

in our model t actual college enrollments a/' tie result of deci- 
sions made hoih by college administrators and by ^pective students* 
The .idministrators determine a set of feasible alternatives for the 
students, who then select a **best** college. If tMs best alternative 
is more attractive than the vaiii. <t possibilities other than college^ 
enrollment follows. The student's evaluation of a given college will 
di*pend on his perception of its benefits and costs. 

In this report « the student's decision problem is separated into 
three successive stages: (1) for each available college, the choice of 
whether to commute or to live on campus, should that college ultimately 
be chosen; (2) the choice of the %cst" LOllege available, given the 
residency decision; ( i) the choice of whether to enroll at this "best" 
college or not at all. The residency choice is determined by the dis-^ 
t.ince from home to college, family income, and other variables. The 
choice among colleges Is affected by several institutional and student 
v;iri«ihles. Institutional variables include tuition, room and board 
cturKcs, average student ability, field breadth, and per student rev- 
enues. Student attributes include family income, ability, and dis- 
tance from home to college. The decision of whether to enroll or not 
is determined by parental education, student sex, family income, and 
the attractiveness of the "best" college alternatives. 

Formally, at each stage of the student's decision problem, we 
'issume that he maximi7«es a utility function defined over the relevant 
alternatives. We use McFadden^s conditional logit maximum likelihood 
procedure to estimate the parameters of these utility functions. Prior 
to estim«ition, several data*-rclated problems were resolved. Foremost 
among these was the necessity to Impute the composition of the set of 
available colU^ges from which the student chooses. 
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Our results show that a utility-maximizing view of student be- 
havior oft\'rs a usivtu) perspective on enrollmentK. Our results regard- 
InR the Impait of priie and academic quality on student decisions are 
ol particular interest. For example, we found that price — both tuition 
and room and board charges— had a lower effect on the decisions of 
studonts from higher- income families. We also found that the effect of 
parcniil education on col legt. -going behavior declines with increasing 
family income. 

After presenting our estimation results, we illustrate the poten- 
tial use of the model for forecasting enrollment patterns. 
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U JNTRODUCTION 



niscussion of higher education policy has been hampered in Che past 
by an inability to predict with much confidence the effects on student 
behavior of propoHvd policies. How do federal and state programo of 
Institutional or student support affect a prospective student's deci- 
sion of whether to enroll and where to enroll? How will the location 
of new colleges affect these decisions? What might the impact be of 
proposed tuition increases In public institutions or the expected clos- 
ing of particular coPe^es and universities? This report describes our 
efforts to develop a theoretical and rapirical model of student behav- 
ior that will help to answer these and similar questions. 

In our model, actual college enrollments are the result of deci- 
Hions made both by college administrators and by prospective students. 
Tfie administrators, through offers of admission and of financial aid, 
determine a set of feasible alternatives for the students, who then 
select a "best" college. Enrollment follows if this best alternative 
is more attractive than the various possibilities other than college, 
such as technical education, the armed services, or immediate employ- 
ment. Colleges offer admission on the basis of relative academic merit 
among those that apply and financial aid on the basis of relative need 
among those admitted. The student's evaluation of a given college will 
be based on what he perceives to be Its benefits and costs. Our specifi- 
cation recognises that college Ih both an invehtment and a consumer good 
and that the taste for college may vary with individual background. Ue 
estimate a model of rollege behavior, and use this to impute the set of 
feasible college alternatives for each student. The model of college 
choice Ih then estimated by taking the actual alternative chosen as the 
one preferred to all others in the feasible set. Finally, we estimate 
a model of coUege-goIng behavior in which the student compares the 

* " ' 

Other tactors, such as regional distribution of sti.idents and 
sttni.nt hettrt>i;encity, .ils.i »'nter th«« college admission and financial 
processes. In tuir tno«lel, however, wr ignore these decision criteria 
for reaK .i.s of simpl i Meat ion. 



**ht*Nt** college available with the alternative of not going to college 
at all. (itir results. In general t corroborate existing belief » about 
the selet liuti ot Htudents by college adminiHtratorH and the selection 
ot ml leges by students* More important^ they bring us closer to our 
ultimate goal ot being able to forecast the effect of proposed federal 
and stare pnlii ies on the number and composition of enrollments and on 
the distribution ot students among institutions. 



II. PRtyipuS RESEAWIH ON THE DEMAND FOR_HJG^^^^^ BDUC^^ 



Thf tMfly empirical rc^Hf.irrh In ihis area included ^ nuinber of 

ft 

lonKiliuidial HtudieK of high Hchool Ktud^ntH. The»e studies confirmed 
that the important factorn inflia»nring enrollment are family incomet 
p;iriMU.ii educat ion« hinh Hchoi>l peer relat ionsships (and tracking) , and 
the proximity of a ciillege to the student home. Unf ortunately, these 
studies had nothing to say on the absolute or relative maRnitude of the 
various i«ftin*ts. For instance^ I,, L. Hedsker and J, W. Trent- found that 
a low-coHt« nearby college was an important stimulator of enrollment^ 
but thfv did not define or quantify "low-cost "nearby, or "important." 

i^itcr studies by economists*^ focused mainly on the effect of in- 
roroi* and price; Stephen Hoenack, however, did consider the effects of 
admission policy, Ttie major defioiencies of all of these studies were 
the hi«h level of a^iiregat f on, with resulting misspecif icat ion of cru- 
« iai variables, and the lack of attention given to admission policy. 



Sec I., L, Medsker and J. W. Trent, lUujond Hujh 3c*h(H»l: A Studij 
./ li\.u^i^ H'.fh .-.•^2.' / :riiiiuaUu^^ Center for Research and Development 
in Higher Education, University of California, Berkeley, 1967; J. 
Flanagan, et aK, '!%' Am^'ViCiifi Hifjh !lfhh)l lHwii*nt^ Pinal Report of 
Coop Research Project No, 635, U.S. Office of Education, University of 
TittHburgh, 1964; and U.S. Bureau of the Census, Fac^tom iu*loUfd to 
.'i'jh •Ih*.^ :9»tuh^if ^nni i\i!h\]*' Alfefuiant'r: UW? ^ Series P-20t 
No. |H'>, July II, 1969. 

^See R. Campbell and B. Siegal, "The Demand for Higher Education 
in the United States, 1919-1964," Amrpfcrj}} E^^o}umu^ Vol. 57, 

No. i, June 1967, pp. 482-^494; H. C;alper and R. M. Dunn, Jr., "A Short- 
Run Demand Function for Higher Education in the United States," f/r)24r*na/ 
•/ /. Vol. 77, No. 5, September-October 1969, pp. 765- 

777; F. Feldman and S. Hoenack, "Private Demand for Higher Education in 
the United States," I'h /;*v»w^im/'f;? ami HfPUi>unKi /•/ Uijhcv Eiiu<fdtti>n in 
^h*' y ../i^/.j:. The Joint Economic Committee, 1969; A. Corazaini, 

1). J. Diigan, and H. Crabowski, "Determinants and Distributional Aspects: 
ot Enrol Imetit in the U.S. Higher Education," •^^^unvuf ^^J' Hf4r*kw /w iV>?ir(V^?, 
Vol. 7, Ni». I, Winter 1972, pp. 19-^0; and C. W. BarneH, E. W. Erickson, 
W. Hill, Jr., and H. S. Uinokur, Jr., "Direct Aid to Students: A *Radi» 
cil* Stnii tural Reform," Inner City Fund, June 1972. 
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Hu» mo.^l ambitious stiidy to d.Jtc has been done by R. Radnor and 
I. Miller. Ihi Kener.il approaiii and spei i t ii at ion ot their study 
ire -imiLu to mit own^ .ind .their report lias ^Iven important direc t ion 
to .Mil wiitk. lUit sfv.'ral uala Wi-aknesses and assumptions limit the 
putititi il Mtilitv nt their results. Firsts although Hadner and Miller 
used the s.ime student data source as We usetl (the SCOPH sample)* they 
had .11 i fss to oiilv a small subset of the available data. Secondt the 
institutional alternatives were specified as broad categories of in- 
stitutions (public lour-year college, public two-year » etc.) rather 
than as the individual schools themselves. The resultant averaging 
of inst itut iimal variables may have weakened their analysis. Thirdt 
their treatment of tumenroi Iment — assigning this option a price of 
zero and a Scholastic Aptitude Test (SAT) score of 344.4 (i.e.* the 
average of nonenrol lees ) — is not very appealing* given the implicit 
assumption that *^ionenrol Iment" Is simply an institutional alternative 
very similar to a low-cost* low-quality school. Fourth, Radner and 
Miller assumed that within a given distance all students were com- 
muters. Fifth* the only variables that entered into their atialysis 
were student ability and income* institutional price* and average 
student ability. This prevented them from addressing the effect of 
family background — e.g.* parental education— on college-going behavior. 
Overcoming the constraints imposed by each of the weaknesses we per- 
ceive in the Radner and Miller effort has been an important source of 
direct io'i in our own effort. 

^Reported in R. Radner and L. S. Miller* **Demand and Supply in 
U.S. Higher Education: A Progress Report*** ArneHaan Eaonor^lo iieviftw — 
Ptif\r\i .znd rpf^^^rrdiK'jr^^ Vol. 60* No. 2* May 1970; and L. S. Miller* 
/'♦r?.z>zci f'>r Hijhi^^r* Eduifatinn in the United State^^ Working Paper No. 34* 
Economic Research Bureau* iitate University of New York* May 1971. 
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I I 1 . A MODEL OF COLLEGE CHOICE 



Thf illoi.ition «>f students to colleges is the result of a process 
i>t mutual selei-tion. This process is influenced by the actions of 
Rovernmt-nt .it various levels as well as by the secondary school systems 
that prepare* the prospective student and provide him with information 
and guidance in <*hoosing a college* Ue have not attempted to model 
this very complex system in its entirety; rather, we have concentrated 
on the hehaviiir of the individual student himself. The behavior of 
colleges was treated iinly insofar as it was needed to provide inputs 
lor our model iif stud(*nt decisionmaking* 

Ci^ru cptually, the student's decisionmaking problem may be broken 
di>wn intii three successive stages: (1) for each available college* 
tiic choiie of whether to commute or to live on campus* should that 
vol lege finally be ihosen; (2) the choice of the best college avail-- 
able, given the residency decision; (i) the choice of whether to en- 
roll at this best college or not at all* A college is available to 
the student it he would he admitted to th;it college were he to apply. 
The etfcctivt* cost of attending an institution is determined by the 
institution itself, by government* and by private groups through the 
setting o! tuition and living costs and through the distribution of 
tiiKUu i il aid. Students vary in ability, location, income, and family 
hack.^^ri'tind si> that the colleges available, the costs aad benefits of 
a givt»n 4-ollt»gt», an<i the alternatives to going to college will be dlf- 
terent ft^r eacli student* 

The limitations of existing theory, particularly with respect to 
ciM lege-go Ing behavior* and the need for empirical tractability force 
us ii^ aili^pl a numhir at simplifying assumptions* The mutual selection 
proi css o\ v iil leges and students is assumed to he recursive rather than 
simultaneous: all colleges make offers of admission and financial aid 
in advance of the student's decision, and these offers remain in effect 
throughout the decisionmaking period* The student then selects a 
college, and this selection is binding: no recontract ing is allowed 
and* typically, the market does not clear* We assume that the student 
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tias portiHt information about colleges. Ue do not consider part*^time 
enrollment or simultaneous enrollment at wt'^e than one institution. 

Clearly, the student will not actively consider all possible 
alternatives^ and he xaay actually apply to very few colleROs. There 
is an important element of self-selection in this process: the student 
will not .ipply to those colleges that lie considers inferior^ too expen- 
sive^ or unlikely to admit him. In our models we include in the feas- 
ibli» set all those colleges to which the student jhl have applied 
and tt» which he would have been admitted; the process of self-selection 
and active selection are treated as one. 

The structure ot our behavioral model is really quite simple. 
iUvcn a teasible set ot college alternatives^ the student ranks these 
in order ol preiereme and compares the best of them with the alter- 
native of not entering college at all. Enrollment follows if the best 
college alternative is preferred to not entering college. In the 
following two subsections, we present the formal structure of the model 
and discuss its estimation. 

THE FORMAL STRlltXURE 

Let A J be the set of college-residency alternatives available to 
a given student, i; and let be his set of alternatives other than 
college. His choice set Cj is defined as the union of A^ and B^. We 
assume that choice is rational, s« that if c is the alternative ac- 
tually chosen from Cj, then there is no c* in strictly preferred 
to c. 

Each student is described by a vector of characteristics Z^, each 
college alternative by a vector of characteristics X^, and each other 
alternative by a vector of characteristics Yj^. We assume that the 
student's behavior is consistent with his possessing a continuous 

it 

stochastic utility function over the elements of his choice set. 

*The stochastic nature of the utility function might be attributed 
to omitted variables or variations in taste. For a fuller discussion, 
see Charles F. Manskl, "Analysis of Qualitative Choice," Ph.D. disser- 
tation, Massachusetts Institute of Technology, Cambridge, Massachusetts, 
June 197 i. Chapter 1. 
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liencis the utility to student i of college alternative af.Aj i« given 
bv 



wlioro 'i is an n-vtTtor of parameters, and » , a veiUir of random 

ia 

monts; likewists the utility of b» B, is given by 

"ib V'i*V^*4b>» <2) 



where f is a vector of parameters and ^ a vector of random elements. 

I D 

The thoiee ot v from implies 

U, ^ iK , for all rSC.. (1) 
ic ie' I 



KSTIMATION 

Our ability to estimate the parameters of the utility function is 
limited by the nature of our observations. For each student* we know 
whethi*>r or not he went to college and, if so, which college he chose. 
We are also able to impute to him a set of feasible college alterna- 
tives Aj. We do not, however, have any information on the student's 
set of alternatives other than college, Bj . Because of this deficiencyt 
we have separated the student's decision into two stages: In the first 
stage, he chooses the best alternative in each of the sets Aj and B^; 
and, in the second, his choice between the two ^'winners'* determines 
whether or not he goes to college. 



The 5'^ Vr*'*Ji^!r5']^>'*^>*y- Model 

Since we have observations on the choice among alternatives in Aj 

(tor those students that did go to college), we are able to estimate 

the parami'ters of IK directly. 

A 

We assume that is linear in parameters with an additive 
disturbance: 



"ia-^^^'V • ^-^^^(Zj) • O^ + .j^, (4) 
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where in an ia-vt»ctor valued-function, is* an (n-in)-vec tor-valued 

tuuitlou; is partitioned into and accordinnly; and i . Is a 

i ^ la 

scalar raudom variable. The element V^(Z,) • 0^ represents the utility 
derived triMii college in general by Individual i in a way that does not 
depend i)n ihe qualities of a given college. The choice of a from Aj 
impl ios 

1 i a 1 2 I 1 la 
^ Vj(Zj,X^,) • + V^CZj) • t)^ + t^^, tor all a^Aj (5) 



or 



(Vj(Zj,X^) - Vj(Zj,X^J) • 0^ i Lj^, - £^^, for all a^LA^. (6) 

It should be clear from this that the choice amomg colleges docs not 
enable us to Identify and estimate the vector of parameters ^^2^ 

In order to estimate the parameters 6j by the maximum likelihood 
principle, it Is necessary to specify a joint probability distribution 
for the random variables > Unfortunately, this specification must 
be made on the grounds of computability since only one probability 
distribution is known to lead to a likelihood function of any simpli- 
city. That distribution is the WeibuU distrubut ion: 



•BT 

Prob(^ S T) « e • a > 0» 6 > 0 constants* (7) 

If f . and ^ .% are independent and identically distributed with this 
1 a la 

distribution, their difference has a logistic distribution with param- 
eter t^: 



1 + e 
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(;%*nerul Uin^ (8) to consider the Joint distribution of the differences 
( . - , •) ti>r all a*LA. and recalling (6) » we obtain 

I <-! I «.t 1 



Prc>h(.i i'ht)scn from A^) 



^ (9) 



1 + ) exp(-e(V,(Z,,X^) • V,(Z.,X J • QJ) 

jSA^.aVa 1 i a 1 i, a 1 



This tvpf ol modf»I — called a conditional logit model-^^vas developed by 
(janid McFadden^ who has shown that an estimate of the parameters that 
maximizes the likelihood of the observed choices is under certain quite 
general 1 conditions consistent and asymptotically normal*^ 

riie Co 1 lt*j^e;>Goin^ Mode 1 

The set of alternatives other than college faced by individual i^ 
Bp is not available to us« but we can capture something of a student *s 
decision on whether or not to go to college in the following way. Con- 
sider the function 

S(Z ,<)),5) = sup U.. . (10) 
^ brBj 

S is a sort of "envelope" function giving the level of utility attain- 
able by individual i when he chooses the best of the noncollege alter- 
natives. It is assumed that S is linear in parameters with additive 

disturhani'e: 



S(7-j.4^,^^j) - W(Zj) • 0 + (11) 



D. McFadden, "Conditional Logit Analysis of Qualitative Choice 
Behavior," in P. Zaremhk « (ed.)» Fvtyntievs in Ecvjnomotrio^\ Academic 
Press, New York, 1973* 
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wluTi* W Is d vfitor-valiu'd funotiun; <^ in n vector of parameters; and 
is a sialar random variable. 

11 a* Is the best rol lege-resideniy eombination (the chosen ele- 
nienl i>f A.), Hull individual I will enroll at a* it 



or 



Vi<^l-^a*^ • ^'l * ^2<^'i> • '^2 * ^ ia ^ * ^ * ^ 



(13) 



or 



(14) 



Betore discussinji the stochastic specification and estimation of 
the parameters of the col leRe-going model, we must point out three 
difficulties with Eq. (14). First, the functions ^2^^^ "^^i^ 
both defined over the same variables so that the coefficients and 
^ are not identified with respect to one another. We therefore replace 
the second expression on the left-hand side of Eq. (14) with the re- 
duced form Y(Zj) • A. Second, the best element of Aj is not known for 
students who do not go to college so that a. the predicted best college, 
must be used for these observations. Using a* for students who do go 
to college would cause a systematic measurement bias (since 
Vj(Zj.X.) • 'ij 2i Vj(Zj.X^^) • Hj) so that in order to avoid this type 
of bias a is used for all observations. Therefore, Vj(Zj,X^^) is re- 
placed by Vj(Z.,X.) in (14). Third. Uj is not known, but we do have 
an estimate B'<^ of isH^. Thus, inequality Kq. (14) is replaced by 

yUjj + Y(Zj) • A S 'ij, (15) 

where « Vj(Zj.Xg) • 30j; Y l« introduced to allow for the scale 
factor B in the estimate of 0^; and is a composite disturbance term. 
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By assuming a logistic distribution for the disturbance^ it is 
ptKSHihU* lo estimate the coefficients y and > using a iogit model and 
a maximum likelihood estimation procedure. 

It can be shown t\\at in the case of dichotomous choice , the 
stochastic specification is relatively unimportant and that a logit 
model will give a good approxim.it ion so long as the functional specif- 
ication Is sufficiently flexible. 
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* X". EMPI RICAL SPECIFICATION AND RESULTS 



In this section we describe the empirical application of the 
theoretical i r;imework developed above. A discussion of our procedure 
tor imput iuK sets of feasible college alternatives is followed by a 
presentation of the results ot estLmating the colleger-choice foodel 
and the college-going model. 

l^^UJJNiv JHE .^AT PI. £EASJ_Bl^^ COL LEGE ALTERNATIVES 

Our principal source of data, the SCOPE survey, provided us with 
information on whether or not graduating high school seniors went on 
to college; and, if it gave the college of enrollment. The SCOPE 
survey does not tell what other colleges the student was admitted to 
or might have been considering. Clearly* without such information no 
inference can be drawn about student preferences over colleges. Ue 
were forced to make up this deficiency as best we could. 

We made use of a number of auxiliary data sources in trying to 
reconstruct the set of feasible college alternatives for each student. 
These were the "Institutional Research File'* of the American Council 
on Education, the "Manual of Freshman Class Profiles (1965-67)" of 
the College Entrance Examination Board*^ and a file of geographical 
data that we gathered ourselves. 

The procedure for imputing the feasible set Involved three steps: 
(I) rejecting as many colleges as possible a priori; (2) predicting 
for the remaining colleges whether or not the student would have been 
admitted; O) predicting the level and composition of financial aid. 



Sohool to i'oLLetje: Opportunities for PosUteoondapy Eduoatian^ 
The Center for Research and Development in Higher Educatlont University 
of California at Berkeley. 

^J, Creager and C. Sell, The InQtitutional Domain of Higher^ Eduaa" 
tion: A i^hamoterii^tiaa File for liesearoh^ ACE Research Reports # Vol. 
4, No. 6, American Council on Education, Washington, D.C.^ 1969. 

^College Entrance Examination Board, Manual of Freshman Claas Pro^ 
files (1 ^€3^ 1 067). New York, 1968. 
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Each step is described below. The scttion ends with a discussion of 
a number ot ^vmral problems arising In the ustimatlon of this kind 
of cboiv'u nkulcl. 

Since iewer than 18 percent of the SCOPE college-goers attended 
coUeses farther than 200 miles from their homes, we felt that little 
information would be lost by excluding such colleges.* Purthernore, 
since very few students were observed to commute farther than 60 miles, 
it seemed reasonable to exclude colleges with no facilities for resi- 
dency on campus that were 60 miles or more away. 

In these calculations, we used the stralght-llne distance between 
the student's hiRh school and the college in question. Since students 
do n..t live at their high schools and since the straight-line distance 
way be a p^uir measure of the actual shortest route, our calculated 
distance is probably quite inaccurato for short distances but more 
reasonable for longer distances. We excluded from the sample all high 
schools having high percentage of boarders because the high school- 
college distance would not serve as a proxy for home to college dis- 
tance for hoarder-*. 

Ajdml_s_slon 

For each student, we took the set of all colleges not excluded on 
a priori grounds and simulated for each college in this set its admis- 
sion decision regarding that student. In order to do this, we needed 
an estimate of the probability of admission of a given student to a 
given college. The results of our estimation are presented below. 
Using this estimated probability, it was possible to simulate the 
collegers decision in the following way: a random number on the unit 
Interval was generated; If this number was less than the calculated 
probability, the college was included in the feasible set; otherwise 
it was not. In this way we ensured that if the calculated probability 



rnis process rejected clearly inferior colleges from the student's 
choice set. If, however, the student attended a more distant school, 
the attended school was included in the choice set. 



ERIC 
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ot admissi*ni was, say^ O.H^ wo iiu ltided the ri)llege in the foasible 
sft with priibahiUty O.K» 

I'Uv t»quai iuns prodictiuK the probability of admission were esti- 
mated iisiim data ♦ rom the CKKB's **M«umal of Fresliman Class Profiles 
(I4hh-ti7)/' This provides reiords of percentages of applicants accepted 
hy ability l^a^cd on their Scholastic* Aptitude Test (SAT) score and 
class r.ink* by sex, and by hijjh school control categories for over 400 
icilli*Kes« Wc based our c>st itmit ion of admission probability on the 66 

collc>;es whose tables show explicitly the interaction between SAT score 

t 

and rlass rank in determining the frequency of admission. We made no 
lifort III distinguish open enrollment schools from those with discre- 
tionary admission policic»s. 

Wc assumed that the frequency of admission was a function of the 
dcviatiiin ot tiie applicant's SAT score from the median SAT score for 
all students enrolled at the college« of his high school class rankt 
of the «ippiicant*s sex^ and of the type of high school he attended. 
In order to have a flexible sperif Icat lon« the continuous relatlve-SAT- 
score variable was introduced In a plecewlse-*linear form and interacted 
with the class rank cate>;ories. 

The results ot the estimation are presented In Table 1 and In 
Kig. 1. The probability of admission increases monotonlcal ly with 
relative SAT s<*ore and with class rank. The appearance of the graphs 
in Fi^. I indicates that SAT score and class rank do interact other 

The procedure we fitially used was slightly different from this. 

It is discusstul in Sec. IV. 
f 

These 66 colleges (predominantly private four*year c*.ol leges) 
wt»ri^ not representative of the whole spectrum of Americ«in colleges. 
In order t** test whether admission procedures differ systematically 
ai-ross colleges, we used less detailed dtita to estimate separate equa- 
tions tor *>ver 1 ')0 i-olle);t»K and then comparcni coefficients with college 
type. Some delinite systematic* variation was founds with public col* 
lt»gc»s rc»lyin^ less heavily on SAT scores for admission than private 
i-ollc'ges. However, the differences were sm^ill enough to warrant use 
of the equation tor the 66 colleges to forecast admission at all 
i'ol leges. 

^Although we reali/e that a probit or logit specification would 
have been superior, programming for such an algorithm had not been 
completed at this stage of the project. 
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Relative SAT score 

Fig J — Probobility of admission versus relative SAT score (using 
normalized dummy voriables, by class rank in quintiies) 

r 

than additively: the interaction is greater for students of lower class 
standings The regression lines for the first and second quintile stu*^ 
dentH are almost parallel* indicating an equal class rank effect for all 
SAT levels. However, for the lower quintiies* class rank appears to 
exercise a much greater effect on admission probability when the stu-^ 
dent*s SAT is close to the college's median score* 

The anymmetry of the admission«»probability curves about the origin 
may be ot some interest. With our piecewise«-linear specif Icat ion » all 
five curves are convex rather than S«-shaped« It seems that increments 
of SAT score have a greater effect on admission probability when the 
student is below the college's median than when he is above it. This 
may be a reflection of a skewness to the right in the ability distribu- 
tion of the applicants to any college. 



17 



KlnaiK ial^ Mil 

lu order to evaluate the aotual cost of attending a college, we 
attempted to predict the level and composition of the financial aid 
that would he ottered to each student by each college. There are two 
quite different ways of doing this: (1) by predicting the expected 
level OS .iid using a regression model; (2) by using a discrete proba-* 
hility model to find the probability of receiving aid, estimating the 
level or aid conditional on aid having been received* and then using 
a ^;imulatinn procedure to determine the aid offered to each student by 
each college. We were unsuccessful in our attempt to obtain accurate 
torecasts i>t tinanciat aid using either approach. 

Our model ot aid determination assumes that colleges made awards 
on the ha^is ot a student's ability and income, relative to that of its 
medi«Mi applicant for admission. The coefficients of our estimated equa*- 
tions were both small and generally insignificant. Among the possible 
4-auses of this result are: (1) general data inadequacies, (2) lack of 
a good specification oi the aid distribution process, and (3) the pos- 
sibility that colleges may have acted capriciously. 

(liven our failures* we did not Include a financial aid variable in 
our final specification for the college^-choice model, although we did 
attempt to i)hserve some of the possible effects of financial aid through 
the interaction i)f tuition and student income, and this is reported 
below. 

Stime Problems Associated with Est Imat ion 

The imputed choitc set of feasible alternatives will usually differ 
from the actual choice set, and this may be the source of a problem. 
The inchisi*ni uf colleges absent from the true choice sei but inferior 
to the I'ho^ i»n college will have ni> adverse effect on estimation since 
the chiu.t* would have been the same even if they had been present. On 
the i>thcr hand, the inclusion of superior or preterred colleges that do 
Uiit «ppt»ir in the t nu» choice* set will make it seem that a college with 
li»s'; oi tht desired qualities is chosen over one with more; this will 
imp.ii t a nk native bias tt) I lie coefficients of these qualities. For 
inst .nee, it our admission simulator ''admits" the student to a college 
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ol hi>»h aiailiMulr quality and thivstudt*nt is observed to choone a college 
of low .uadtmir <juallty» it will swrn that \w dislikes academic quality; 
wherea^i« in i ai t ♦ he mi^hl have preferred tfie lollege of hlRb acudemir 
qualitv hui it re.illy been a feaHible alternative. 

This problem is a^Ktavated if» as we believe, students have a posi* 
t ivt» prt'l t»rrn4 t» tor .icad<*mir quality up to some point but then begin to 
dislikt* si*ho4)ls in which academic levels are too tar above their own. 
It rhin tu'ronu*s almost Impossible to distinguish between such an effect 
and the bias introduied by the simulation of admission. The steps we 
have taken in an attempt to mlnimixe this bias will be described in our 
discussion ot the estim<ition of the college-choice model. 

The consequences of choice^set imputatiout both on the estimated 
coefficients of the utility function and on the associated reported 
standard errors» are still not fully understood. An alternative and 
theoretically sounder procedure would be to maximize a likelihood func- 
tion in which the choice set» as well as choice from a given sett is 
probabilistic. That is» let a* be student i's chosen college;! 4, a 
universe ot possible clu>ice sets* each including a*; and Pj(A) the 
probability thai A* c^l is the choiie set faced by student i. Then^ the 
marginal pn>habiUty that a* is chosen is 



Pjt i*) " 2^ »'i<A)Prob(Uj^^ i: V^^^, all a^A). (16) 
A» .1 



and t Uv 1 i kt* 1 i hood f unc t i on i s 



f][pi(a*). 



I- - I I P,(a*). (17) 
i 

Wt» ionsldered estimation of n through maximisation of I. but re- 
jected it as i pr.utical procedure because of its prohibitive compute^- 
tional exp»»nsi»» It I. Is accepted as the correct likelihood function, 
then tlie tum iitMi we did maximize should be Viewed as a quasi-1 ikel ihood 
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tuni tiiin, niu* in whiiii a •'certalnty-fqulvaleni" ihoice stft replact^B the 

* 

«ti'tu«il distrihut ion ot possible nets. 

Anotiu*r problem arises because the conditional logit model attsumes 
inJepenilefit ly distributed disturbances* To utiderstand the probleiSt con** 
sider a ihoice between a private university and a community colleget with 
the probability i>f ihiHisiuK the former 0.6, and that of choosing the 
latter, 0.4. Now introduce a second identical community college into 
the chc>ii'e set. In the conditional logit mod^^ the probability of choos- 
ing the university becomes 0.43; for choosing each community colleget 
Thus, *he total probability of choosing r><'^ni coimnunity college 
now exceeds that ot choosing the university-^^the reverse of the original 
si t uat ion. 

The source of the problem in the above example is an improper as- 
sumption ot independent disturbances in the stochastic utility function* 
Civen that the two community colleges are identical* their disturbances 
(which are due to omitted variables) should also be identical. In prac- 
tice, alternatives are rarely identical but may be **simllar«** in which 
case the problem persists if their similarity extends to unobserved 
dimensions. In order to minimize the amount of unobserved similarity 
amon^ college alternat ives» and hence to make the assumption ^£ inde- 
pendent disturbances plausible* we augmented our set of college attri- 
butes with a set of college-type dunny variables. It was hoped that by 
usinK duninies to pick up choice-relevant features that each college type 
shares (which were not included among our <ixpllcit college attributes) t 
the remaining unobserved attributes would De independently distributed. 

Our imputed rhoice«*sets contained a large number of colleges (most 
included t rom SO to 150 schools) so that computational considerations 
toned us to find some way of decreasing the number. We decided to select 
ten i'ol legt»s at random from the imputed choice set of each student and 

estimate usin^ this random subsample plus the college actually chosen 
as the choice* set* We found th«it changing the randomization (i.e.t « 

*Lack of data regarding ohoice*-set composition is a cotuaon occur-* 
rencc in current data sources. A formal analysis of the consequences 
of choice-set imputation procedures and other means of handling the 
problem would be a useful contribution. 
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dravluK different subsample rholce sets fur each student) had almost 
no effei*t on the estimation results* 

Lai-h college in a student choice set actually embodies two alter-* 
natives, i or respimd ing to the two residency choices of coosnuting or 
living on campus. Kecall the college utility function given in Uq. (4): 

in this equation **a** should be taken as a college«*residency combinatlon« 
not simply as a college. Because for a given college many X attributes* 
and consequently many functions* are invariant to the residency 
option* it becomes feasible to separate estimation of the part of the 
utility function relevant to re^>Ldency choice from that of the remainder 
of the function. Prior estimation of the residency model was performed; 
it was useful because it allowed us to forecast a student *s hypothetical 
residency choice at each college and hence to trim the size of the choice 
set in later estimations. 

The residency-choice model was formulated as a dichotomous logit 
model and was estimated using a subsample of the SCOPE survey (students 
attending a college at which they could have chosen either alternative). 
The results are presented in Table 2 and Fig« 2. 

The first four coefficients in Table 2 are for the interaction of 
distance and income (measured in log to base 10). If X is a distance 
value and Y is an income value (measured In log to base 10) « then the 
following formulas are used to determine a probability of campus resi- 
dency for that distance-income combination. Variable (1) « (100-X)(5-Y)/500; 
(2) » X(5-Y)/500; (1) » ( 100-X)Y/500; and (4) » XY/500. It is difficult 
to interpret the coefficients of a piecewise-linear specification* but 
Fig. 2 should make this easier. Tracing the probability of campus 
residency as a function of distance for three income levels* the graph 

In some cases « more than two such choices may be available • How-* 
ever* for college freshmen* this is rarely so. 



Table 2 



KSTIMATEU CUEFFICIENTS OF THE RESIDENCY CHOICE MODEL 



Variable 


ti •in So n f 

Values 


Coefficient 


Error 


Ulstance-Log Income Interaction 
Distance Ia;^^ (Income) 

100 0 
0 5 
100 S 


ft - 1 
0 - 1 
0-1 
0-1 


-8.680 

-0.6277 

10.81 


1 71 

0.90 
0.40 
0.70 


Percentage Dormitory Capacity^ 


0 - 100 


0.01929 


0.0023 


Student Sex (if female) 


0-1 


-0.04789 


0.11 


Residency Preference^ 
(1 * on-campus) 


0-1 


1.470 


0.16 



This variable may reflect the campus character of the school • 
It also may be interpreted statistically as a mixture probabilityt 
where the kernel distribution is the choice probability condi- 
tional on college and student attributes and the mixing dlstrlbu^-- 
tlon is the distribution of such attributes. 



The SCOPE question asked the student what his residency deci* 
slon would be if money were not ^ problem. Thust it should capture 
the pure preference for living style. 



shows that the probability Increases with distance and is higher at each 
distance for the student with higher income. 

The interpretation of the remaining coefficients is straightforward. 
The probability of campus residency increases with percentage dormitory 
capacity, is very slightly higher for males than for females* and is 
higher for those students who» all else equal* would prefer to live on 
(!ampus« 

This model of residency choice may be of some Interest beyond its 
use here, since an understanding of this aspect of student behavior is 

It is, of course, not surprising that choices should reflect stu- 
dents* pure. preference for residency mode. It is interesting* though* 
how small a role pure preference plays compared to the economic variables 
of distance and income* 
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Oi$tonce (miles) 



Fig.2 -~Probabil!ty of living on campus (for males attending a college with 
50 percent dormitory capacity ond who prefer to live on campus) 

needed in res^olving rertain issues in higher education policy* As an 
example^ one could citv the current controversy over the location of 
cooununity colleges. It is often said that such colleges should be built 
in the city to provide morv readily available education for the urbat« 
poor. The arfttiment ptit forward for this policv — one of bringing the 
college to the student — is that connnuting over short distances results 
in lower out-of-pocket costs than campus residency. Therefore* it is 
asserted* the availability of nearby community ctU leges will encourage 
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».nronmtnt by low-lntona. students. Tliere Is, however, another possible 
solution: to buihl i-ampus-type tonmunity colleges on low-rost, non- 
urban land and to subsidize the on-campus living costs of poor students. 
It it Is tound that students Strongly pri-ft?r on-campus residency to com- 
muting, thtn thf latter alternative, not the former, would provide the 
better solution to the college location problem. 

KHTl^VVVlNli. rHE^^COM^KUKjaK^^^ 

The rollege-choiee model, apart from t' e residency-choice component, 
was estimated using a subsample of the SCOPE survey that included stu- 
dents uraduating from Illinois high schools who went on to enroll in 
some lollege. This subsample contained some 3000 observations. These 
observ. ions were divided into three income strata of approximately equal 
size, and the model estimated separately for each stratum. This allowed 
full interaction between income and all other variables. 

The first subsection below describes the income variable used to 
carry nut this stratification. The next four subsections deal with the 
four types of variable Included in the specification of this model: 
(1) variables relating to the cost of a college, (2) those relating to 
academic quality. (3) those relating to the "quality of life," and (A) 
dummy variables for college type. 

The model was reestlmated on a similar subsample of seniors gradu- 
ating trom North Carolina high schools to see whether behavior was 
reasonably uniform in different states and hence whether our estimated 
coefficients would be of any use in predicting behavior outside the 
sample. The results are described in the final subsection. 

Income Variable 

The difficulties associated with the reporting of family income in 
the SCOl'K survey have been discussed at length in an unpublished paper 
by I.. S. Miller. We decided to use family income reported by parents 
minus a deduction for family size as our income variable. We were 
forced to predict family Income on the basis of other variables, however, 
because of the large number of observations for which these data were 
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misslnK, B*.iausf our incooie predictor is quite similar to Miller's* 
and Is ot little intrinsic interest, we do not report the details here. 
The deduction for family size was made on the basis of the family size 
allowance used by the College Scholarship Service (CSS). The CSS bases 
its allowance on the impact of additional children on the "moderate in- 
come level" of the Bureau of Labor Statistics. We used this deduction 
to control for effects of children on disposable income available for 
college expenses. 

Cost V ariables 

We view the student as comparing different colleges on the basis 
of their costs and benefits, and our coefficients are intended to 
capture the revealed tradeoff between these costs and benefits. 

The principal cost variable is college tuition. Although finan- 
cial aid should be deducted from tuition to give mt cost, our failure 
to find a satisfactory method of predicting financial aid prevented us 
from doing this. As a substitute, we introduced a term quadratic in 
tuition, the rationale being that the burden of tuition would rise less 
than linearly with the amount, particularly for poor students, since 
hlgh-tultlon colleges are also the ones most likely to offer financial 
aid. The coefficients of variables 1 and 2 in Table 3 seem to bear 
this out. Figure 3 shows the disutility of tuition for the three in- 
come levels: the curves become lower and flatter as income rises, 
exhibiting decreasing curvature as well. 

If the student "chooses"* to live at hone and to conmute to a par- 
ticular college, then variable 3 is set equal to the distance from home 



The only major difference is our use of an additional variable: 
the median reported income for parents of students In the particular 
high school. Income was predicted even for those observations where it 
was reported because we felt that the predicted value was probably a 
better measure of permanent income. The use of this variable as the 
basis for a crude stratification rather than as a continuous variable 
in the equation Itself reduces the importance of precise prediction. 

It also prevented us from comparing the value of different types 
of aid (fellowship, loan, work-study), something we had hoped to do. 

The result of our simulation of the residency decision. 
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Table 3 



LHTIHATEO COEFFICIENTS OF THE COIXEGE-CHOtCE MODEL 
(Illinois data) 







Units (approa- 


Coefficients, by Incone 


Stratiw 






imate range 






> >ll,47u 




Variables 


ot values) 


< $7.B60 


$7,860-11.470 


Cot»t 










-0al05 


1. 


Tuition, $ 


100 


-0.397 


-0.285 






(0-40) 


(0.026S)' 


(0.0247) 


(0a0l96) 


2. 


(Tuition)^, S 


(100)2 


UaUUo«li 


UalNIO^U 


0.00242 




(0-1600) 


(0.000882) 


(0 a 00090 7) 


(0.000560) 


3. 


Distance from home to 








-0.0254 




college. If coonut-* 


Miles 


— wa V 


«0 0532 




Ing, 


(0«-200) 


(0.00749) 


(0 a 00946) 


(0.0124) 




Room and board. If liv*^ 


100 


-0.186 


-0.136 


0.00589 




Ing on campus, $ 


(0-30. aporoXa) 


(0.0143) 


(0.0139) 


(0.0150) 


Academic Quality 






1.02 


1.63 


3. 


Average student ability 


100 SAT points 


l.OB 

A a W 




at college 


(2-8) 


10 1231 

\wa ^mt^i 


(0.113) 


(0.107) 


6. 


Ability difference^ 


(100 SAT points) 2 


•0.263 


-0,415 


-0.298 




(0-36) 


fn n^4^i 


fO 05441 


(0.0498) 


7. 


College revenue per 


1000 


«.0 0197 

— Va VA7# 


«^.0694 


-0.0529 




student, $ 


(X-4) 


fO 0337) 


(0.0293) 


(0.0254) 


8. 


Breadth of offering. 




0.125 

Wa 


0.183 


0.0934 




Index 


(1-13) 


\Va M^^H^ 


(0.0261) 


(0.0242) 


Quality of Life 








-0.698 


9. 


Coeducational college 






7flQ 




(0.1) 


(0.183) 


(0.149) 


(0.127) 


10. 


Dormitory capacity, X 




-0.00109 


-0.00628 


-0.00108 




(0-100) 


(0.00200/ 


\UaUU^ JXJ 


(0.00206) 


Duanles 










11. 


Private university^ 


(0.1) 


0 


0 


0 


12a 


Private 4-*year college 




-2.33 


-1.77 


-1.55 




(0.1) 


(0.203) 


(0.166) 


(0.145) 


13* 


Private 2*year college 


-4.06 


-2.26 


-0.843 




(0.1) 


(0.444) 


(0.382) 


(0.374) 


14a 


Public university 


-1.6S 


-1.19 


-0.0579 




(0.1) 


(0.207) 


(0.123) 


(0.14A) 


15. 


Public 4«year college 




-2.42 


-1.83 


-0.675 




(0.1) 


(0.232) 


(0.200) 


(0.187) 


16. 


Public 2-year college 


-1.29 


-0.361 


0.0279 




(0.1) 


(0.332) 


(0.322) 


(0.342) 


Number 


of observations 




997 


990 


1028 


Maximum log** likelihood 




-1410. 


-1650. 


-1770. 


Log* likelihood for ^ • 0 




•2280. 


-2310. 


-2420. 



^Numbers In parentheses are asymptotic standard errors. 



(Average SAT score at college-student SAT)2a 
Normalization* 
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5 




0 2 4 6 8 10 12 14 16 18 20 



Tuition ($ hundreds) 
Ffg.S — Disutility of tuition 

to college; otherwise it is set equal to zero. Slmilarlyt if the stu- 
dent chooses to live on campus* variable 4 is set equal to the cost of 
room and board at that college; otherwise it is set equal to 2ero« Thus« 
a college is **debited" with transportation costs* as represented by the 
distance « if the student commutes* and with the cost of room and board 
if he lives on campus. 

The roo0-and -board variable may also be interpreted as an indicator 
of ''quality of life** in addition to its role as a cost variable: Higher 
charges for room and board may* in some cases » Indicate better living 
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vtMidlt l«»ns. Thi- t lU'l t ii unt s sovm to Hupport this i inu lusion nlnce thv 
vtttM t ut a -iilOO iiu riMHf In ronni-und-boar«l ftM'« has less impact than 
.i Himil.ir liuro.ts.' in tuition (txit-pt at high levt'ls »>f tuition). Fur- 
t»u'nnor«.-» thv . tu-t I io itnt is pijsitivi- for thf hlRlifst im-orae stratum. 

Bv vomp.irin« the thanK^s in utility resulting from marginal changes 
ill luitioti and in thi- distance from liomi' to loUfKC, It is possible to 
t ali ul.ii f the implicit evaluation, in money terms, of a mile traveled, 
l or I lie low-im oine stratum this is about $0.05 per mile; for the middle- 
itUi>me stratum, about $0.11, and for the high-income stratum, about 
$0.17.* 

Aiademii- t^oality Variables 

The average Scholastic Aptitude Test (SAT) score for students at- 
tending a college was used as one measure of academic quality (variable 
■>). Although we believed that students would prefer colleges with 
higher average SAT scores, it seemed probable that a student would not 
wish to attend a school where the average ability was too far above his 
own. In order to capture this effect, we Included variable 6, a measure 
ot the distance ot the student's ability from tliat of tlie average student 
at the lollege. Thus, the total Impact of average SAT is obtained by 
«omhining its effects through variables 5 and 6. 

As we mentioned in our discussion of the problems associated with 
imputed choice sets, it ir. impossible to isolate this aversion for 
schools ot too high an academic level from the bias caused by our slm- 
ulat ion of c«>nege admission. We did, Itowever, conduct some experiments 
to determine the sensitivity of the results. This was done by excluding 
altogether from the choice set those schools with a predicted admission 
probability helow a certain level. These experiments had little effect 
on the coefficients of variable 5, but the coefficients of variable 6 
till s.imewhat in absolute value as the cutoff level was raised from 0 
to O.JS and almost disappeared as the cutoff level was raised again to 

*Thls Is calculated on the basis of 150 round trips per year. The 
marginal dis-utillty of tuition taken at $1000. 

It shield be noted that distance also enters as an explanatory 
variable In the residency choice model so that its full impact on stu- 
dent behavior is divided between the two models. 



28 

O.S. Other toefflt ients were minimally affected by these experiments. 
The coot t itientH reported in Tables i and 4 are for an admission cutoff 
level of 0.2'>, whiih we felt was probably sufficient to counter t:he 
bias. (This is, (it course, only a k"u8s.) 

We had hoped that variable 7, college revenue per student, would 
be a proxy for educational expenditure and ^o an indicator of academic 
quality. However. It proved to be of little statistical or economic 
siRnif loanre, and the coefficient had the "wrong" sign. 

Variable 8, breadth of offering, is an index constructed by us from 
the list ot fields, for each college, in which a bachelor's degree is 
offered. As expected, this variable had a positive coefficient showing 
that students preferred schools offering a wider choice of possible 
specializations. This preference seems to be stronger in the middle- 
Income stratum than in the high and low strata. 

<I"aJLlti'- of~Life Va riables 

Since we believe that studenjts view college at least in part as a 
consumption good, we tried to capture the value of a college In this 
respect through variables representing the "quality of life" at the 
school . 

Variable 9 is a dumny variable set equal to unity if the college 
is coeducational. Contrary to our expectations, the coefficients proved 
to be negative. Variable 10 is the dormitory capacity of the collcjie as 
a percentage of enrollment; it was included as an indicator of the degree 
to which the school was a campus rather than a community institution. 
This variable had little influence on decisions. As we mentioned above, 
variable 4, fees for room and board, may also be Interpreted as a quallty- 
of-llfe variable. 

College-Type Dummy Variable 

Variables 11 through 16 are a set of dummy variables for college 
control and type. Any behavioral interpretation of their coefficients 
is problematic because they capture the influence of a combination of 
unobserved variables. Our use of college dummies is similar to the use 
of mode-specific variables in transportation-choice studies. 
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R>>*st inmjon^^^^ North Car oHna D ata 

The i oef J ii*ii»nt s of the col lege*4*hoii*i* model estimated from North 
Carolina dat.i are presented In Table 4. Although there arc differences 
from the Illinois results — most notably for variable 1 — and while a 
4*lassii*al statistical test for equality of the two sets of coefficients 
would failt the similarity between the two sets of behavioral coeffi- 
cients (variables 1 through 10} indicates that there may be considerable 
uniformity in the behavior of students in quite different geographical 
areas. It also suggests that our results are rather better than might 
have been anticipated in view of all the simulation and imputation of 
crucial variables. 

l^STJMAXINC THE COLLEGE-GOING MODEL 

The college-going model was estimated from a subsample of the SCOPE 
survey consisting of all Illinois high school graduate8«^«^both those who 
did enroll in some college (these were also used to estimate the college-^ 
choice model) and those who did not. The subsample contained about 7000 
observations* which were divided according to the same Income strata 
used in the college-choice model. The estimated coefficients are pre- 
sented in Table 5. 

Recall the structure of this model* described by Eq. (15) and re«* 
written below: 

>Ujj + Y(Zj) • X 2t n^. 

The principal variable^ variable 1 in Table 5« is the utility of the 
best college available — that is* U^^* This is taken to be the highest 
utility of any college in the imputed-cholce set when the utility is 
calculated using only the behavioral coefficients (i.e.» variables 1 
through 10 in Table 3). We tried to calculate the highest utility 
using the college*-type dummy variables as well* The likelihood was 
increased* but not by very much. Consequently t we excluded the dunnies 
since their behavioral significance was doubtful and their contribution 
here small. Variables 2 through 22 are the Y functions of Eq. (15). 
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Table 4 

tSTIHATED COEKFlriENTS OK THE COLLEGE-CHOICE MODEL 
(North Carolina data) 



Variables 



Unit Is (approx' 
imate range 
of values) 



Coef f icientM, by Income Stratum 
^ $7,860 I $7,860-11.470 I > $11,470 



Cost 
I. 

2. 

i. 



Tuition. $ 

(Tuition)^. $ 

Diatance from home to 
college, if connut- 
ing, 

4. Room and board, if liv- 

ing on campus. $ 

Academic (Quality 

5. Average student ability 

at college 
6 Ability difference^ 

7. College revenue per 

student. $ 

8. Breadth of offering. 

Index 

Quality of Life 

9. Coeducational college 

10. Dormitory capacity. X 

Dummiea 

11. Private university^ 

12. Private 4-year college 

n. Private 2-ycar college 

14. Public university 

15. Public 4-year or 2-year 

college 



100 
(0-40) 
(100)2 
(0-1600) 

Miles 
(0-200) 
100 

(0-30. approx.) 

100 SAT points 
(2-8) 
(100 SAT points)^ 
(0-36) 
1000 
(1-4) 

(1-13) 



(0.1) 
(0-100) 

(0.1) 
(0.1) 
(O.l) 
(0.1) 
(0.1) 



0.250 
(0.0722)* 
-0.0237 
(0.00479) 

0.0239 
(0.00419) 
-0.162 
(0.0121) 

0.837 
(0.0639) 
-0.61S 
(0.0406) 
-0.0920 
(0.0377) 

0.0844 
(0.0153) 

0.795 
(0.156) 
-0.00936 
(0.00152) 

0 

-1.1.9 
(0.276) 
-1.31 
(0.343) 
0.000711 
(0.247) 
-0.00576 
(0.271) 



-0.174 
(0.0503) 
0.00250 
(0.00271) 

0.0239 
(0.0100) 
-0.141 
(0.0199) 

1.89 
(0.116) 
-0.549 
(0.0629) 
-0.112 
(0.0542) 

0.134 
(0.0276) 

0.634 
(0.171) 
-0.0108 
(0.00215) 

0 

-0.596 
(0.353) 

0.724 
(0.499) 

0.124 
(0.265) 

0.244 
(0.348) 



-0.0952 
(0.0440) 
0.00176 
(0.00159) 

0.00758 
(0.0144) 
-0.0987 
(0.0192) 

2.13 
(0.119) 
-0.428 
(0.0574) 
-0.0786 
(0.0513) 

0.102 
(0.0277) 

-0.0229 
(0.148) 
-0.00337 
(0.00219) 

0 

-0.0951 
(0.354) 

1.26 
(0.509) 

1.03 
(0.267) 

0.778 
(0.359) 



Nuaber of observations 
Maximum log-likelihood 
Log- likelihood for 6 • 0 



1623 
-2810. 
-3860. 



749 
-1250. 
-1790. 



760 
-1240. 
-1820. 



niuBbtrs In parentheses are asymptotic standard 
(Average SAT score at college-student SAT) 2. 
^Normal isat ion. 



errors. 



labU- ') 







ApproKimate 


<\u'ttii ifuts. bv Ini'iHDi' Stratum 


















Kauge of 












V.I lues 




S7.rt6(>-n ,470 


• $11,470 
















M i I it V ot hfst 


(•H to ♦«) 


l.OM 


(».H7h 


0.97 3 








(0.0»»27)'* 


(II.0SH5) 


(0.06H8) 


I It lu r ' 


^ vx\\\K .It 
















-0.0 j4f) 


-0.0487 


0.5H8 






(0,1) 


(0.21 J) 


(0. ) i9) 


(0.770) 




Fintstii'd gradt' school 




-0.0975 


0.148 


0.974 






(0,1) 


(0.210) 


(0. 319) 


(0.636) 








0.163 


0.400 


1.21 






(0,1) 


(0.209) 


(0. 110) 


(0.596) 








0.601 


0.537 


1.46 






(0,1) 


(0.2J0) 


(0.313) 


(0.592) 




Si>nR» collv^t* (or other 




0.805 


0.577 


1.80 




poHt high school) 


(0,1) 


(0.358) 


(0.373) 


(0.596) 




F i n i shvd 1 1 egt^ 




2.79 


1.14 


1.84 






(0,1) 


(0.839) 


(0.453) 


(0.621) 




M.»«4t%»r's di'gret* 




2. 33 


1.53 


1.79 






(0,1) 


(1.04) 


(1.06) 


(0.628) 


4. 


Doi tor * s <U*Kre«* 




-0.406 


-0.780 


0.279 




Not reported^ 


(0,1) 


(0.312) 


(0.407) 


(0.689) 


10. 


(0,1) 


0 


0 


0 


Mother' 


s educat Ion 










1 1 . 


^ume ^r<tdi* school 




O.SSfl 


0.935 


-0.476 






(0,1) 


(0.283) 


(0.S64) 


(1-15) 


U\ 


FlniHiied >:rade school 




0.685 


0.910 


-0.921 






(0,1) 


(0.270) 


(0.490) 


(1.09) 


1 { 
1 J • 


r»l>nH;» III ^11 SCn<><>l 




0.698 


1.29 


0.0163 






(0^1) 


(0.273) 


(0.485) 


(1.07) 


u. 


Finished high school 




1.26 


1.83 


0.624 






(0.1) 


(0.290) 


(0.490) 


(1.08) 




Som*» college (or other 




1.35 


1.94 


0.640 




povit high school) 


(0.1) 






\ I •UOJ 




Finished colleRe 




1.97 


1.34 


0.75e 






(0,1) 


(0.881) 


(0.662) 


(1.13) 


i;. 


Master's degree 




0.216 


0.453 


1.11 






(0,1) 


(1.53) 


(1.50) 


(1.57) 


IH. 


Dor tor's degree 




0. 360 


0.627 


4.18 




Not reported^ 


(0,1) 


(0.395) 


(0.597) 


(1.17) 


19. 


(0,1) 


0 


0 


0 


StU'ItMlt St»X 












Malc^ 


(0,1) 


0 


0 


0 




Female 




0.448 


-0.263 


-0.0422 






(0,1) 


(0.0883) 


(0.102) 


(1.40) 




r oitstant 


1 


-6.00 


-5.10 


-8.80 








(0.387) 


(0.562) 


(1.33) 




of observations 




3436 


2177 


1493 


Maximum log*l ikt*! ihood 




-1540. 


-1150. 


-661. 


UiH^ like 11 hood for • n 




-2380. 


-1510. 


-1030. 



Numbers in parentheneft are aaynptodc standard errors* 



Norsaliaation. 
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Vuriablt ^i J throu^ih 10 In Table S are a set of dummy variablen rcpre- 
!^tMUinK the edurat ion of the student*s father. Variables 11 through 19 
are a set ot Uuinmy variables representing the education of the student's 
mother. The t'tttut of the father's education seems to be greater than 
that of the mother's. There seems to be a lump'' In the probability of 
the student attending college if the father has completed college (or 
had some cul Ivge for the high-income group) and if the mother has com- 
pleted high Sihool. Tliere is; a decrease in the importance of parental 
education as family income rises. 

Variable 22 is a constant t and the differences between the values 
ot its coefficient tor the three income strata represent "pure" income 
effects. Variable 1« the utility of the best college, already accounts 
for all of the effect of family income on the availability and attrac- 
tiveness of college alternatives. The "pure" Income effect here repre- 
sents the effect of a change in family Income when all other variables, 
including the utility of the best college^ are held constant. 

The hIzv of the difference in the constant coefficients depends on 
the normalization of the other dummy variables: The values in Table 5 
are for male students reporting neither father's nor mother's education. 
If we normalise on male students whose fathers have finished high school 
and whose mothers liave finished grade school, the constants become -A. 7, 
-3.7, and -7.i. The same kind of differences occur in the constant as 
income rises, regardless of the normalization: The probability of going 
to college rises as we go from the low-Income stratum to the middle- 
income stratum, and then falls sharply as we go on to the high-income 
stratum. This result does not contradict the fact that a higher propor- 
tion of students from high-income families go to college than those from 
low-income families. It does Imply, though, that this pattern among 
enrollment rates is largely explained by the existence of more attrac- 
tive college alternatives for students from high-income families. This 
relative attractiveness depends, in part,, on the lower disutility of 
tuition for higher-income students. If faced with equally attractive 
college alternatives (in the subjective sense of equal utility), the 
high-income student will be less likely to go to college. Furthermore, 
in the light of our previous theoretical discussion, the result is not 
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really surprising situ*(» high-income students probably face a more 
attractive set of nonrollege alternatives. 

The i ollcge-Koing model was rcestlmated with similar North Carolina 
data. Tlic pattern of results did not differ substantiallyt except that 
the parental education effects were weaker^ and the mother's education 
seemed mure important than the father's (see Table 6). 
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Table 6 

CSTIMATKD COEFFICIENTS OF THE COLLECE-GOING MODEL 
(North Carolina data) 







ApproKlttate 


Coefficients, by Incoac Stratua 






Range of 










Variable^ 


Values 


< $7,860 


$7,860-11,470 


> $11,470 














Utility of h^At 












col l6|j^e 


(-8 to -fS) 


0.936 


1.03 


0.841 








(0.039)* 


(0.0705) 


(0.0659) 


FatH«»r 


*tt education 










•% 


Soiae grade school 




0.278 








Plnitihed srade Brhool 


(0.178) 


(0.593) 






0.282 


1 22 

A • ft* 




4. 


Soae hish school 


(0.1) 


(0.123) 


(0.539) 


(1.76) 


(0,1) 


0.876 


1.30 


0.00563 






(0.122) 


(0.525) 


(1.23) 


Finished hl&h flchool 




0.994 


1.57 


0.429 




SoAie collese lor nfhAr 


(0,1) 


(0.145) 


(0.530) 


(1.23) 


• 






post high school) 




0.889 


, 

1.88 


0.576 


7. 


Finifihftd rnl leff^ 


(0,1) 


(0.217) 


(0.557) 


(1.23) 




1.26 


1.69 


0.500 


^. 


Master's degree 


(0,1) 


(0.397) 


(0.663) 


CI. 26) 




0.246 


3.30 


0.742 


9. 




(0,1) 


(0.728) 


(2.56) 


(1.28) 


Doctor *s deSree 




-0.09S3 


0.354 


-0.868 


10. 




(0,1) 


(0.1S8) 


(0.599) 


(1.34) 


Not r*eDorted^ 


(0,1) 


0 


0 


0 


Mother 


H education 










i i • 


Some grade school 




0.0324 


1.21 


-1 OS 


12. 


Finished srade srhnnl 


fn 1) 


(0.180) 


(0.808) 






0.319 


-1.38 


-0. 772 


I i* 


SoOBe hifth «4chool 


fn 1 ) 


(0.167) 


(0.743) 


(1.10) 




0.870 


-0.770 


1.06 


lA. 


Finished high scliool 


(0,1) 


(0.164) 


(0.734) 


(1.04) 




1.S2 


-0.345 


1.29 


IS. 


Some college (or other 


(0,1) 


(0.184) 


(0.742) 


(1.05) 




2. OS 


0.364 


1.57 


16. 


post high rtchool) 


(0,1) 


in 9 in) 




(la05) 


Finished college 




1.54 


0.457 


2.04 


17. 


Master's degree 


\0,1) 


(0.467) 


(0.998) 


(1.12) 






0.120 


1.68 


16* 




(0,1) 




(1.49) 


(1.43) 


Doctor *s degree 






— X . / J 


n SAC 


19* 




(0,1) 


(2.45) 


(0.860) 


(1.20) 


Not reported*^ 




0 


0 


0 


Student 


sex 










20. 


Male^ 


(0,1) 


0 


0 


0 


21. 


Female 




-0.053 


-0.0741 


-0.394 


22. 




(0,1) 


(0.0668) 


(0.135) 


(0.171) 


i^onstant 


1 


-6.58 


-9.35 


-8.93 








(0.250) 


(0.925) 


(1.42) 


Ntimhcr 


of observations 




6389 


1453 


1096 


Maxlmutn 


log^likelihood 




-0.281 X 10* 


-0.694 « lo' 


-0.457 X 10-^ 


Toe <^ likelihood for e • 0 




-0.443 » 10* 


-O.lOl K 10* 


-0.760 « 10^ 



Nttmbers In parentheses are asymptotic standard errors. 
^Normalization. 
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V. A S IHPLE EXAMPl.E OF THE USE OF THE„RESJJLTS^ }^IP^^?JJ^9 



The purpose ut our study has been to investigate and estioate the 
impacts ot student and institutional attributes on college-*going be- 
havior and to describe the use of these estimates in predicting the 
effect ot alternative higher education policies. All too often* in- 
vestigators arrive at a set of estimates or coefficients and leave the 
use ot their results entirely in the hands of others* In most Instances 
this leaves the estimates either unutilized or misused. To avoid this 
problem* and to fulfill partially the second goal of our study* this 
section presents a simplified simulation of the results of the choices 
facing a student. In our simulation* we forecast the behavior of a 
student facing a choice among three alternatives; choosing a public 
university* choosing a public tvo«*year college* and not enrolling. The 
simulation follows the three stages of the estimation process. The 
public policy question is where to locate the two-year college given 
the already existent university. 

The student in our simulation is a male with a Scholastic Achieve- 
ment Test (SAT) score of 500* which places him approximately at the 
median for high school graduates. His family income is $6000; his 
father graduated from high school; and his mother has some high school 
education. He prefers living on campus to living at home and coDmutlng. 
He is an Illinois resident. The role of these various student character 
iHtlcs is described by the appropriate coefficients in Tables 2* 3* and 
S. Tlie attributes of the two colleges are as follows: 







Public 




Public 


Two-year 


Variable 


University 


College 






$200 




$1000 








4 






$1000 






500 






Yes 






02 






(Variable 




0 miles 
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The first stage in the simulation is to estimate the probability 
that the student will live on campus given the various distances from 
home to college. Using Table 2 and Eq. (16), we arrive at the follow- 
ing estimate of the student's residency choice: 



Public 
Public Two-year 
Variable University College 

Probability of 
living on caaqtus 



Using these estimates, we assign the student to commuter status in each 
institution for the remainder of the simulation. 

In the second stage of the simulation, we estimate the attractive- 
ness (or utility) of each institution for the particular student. This 
is done using the equation U • V • 6, where 6 is the appropriate vector 
of coefficients from the college-choice equation (Table 3) and V is the 
vector of institutional attributes. The results of the utility calcula- 
tion are shown in Table 7. 



Table 7 

UTILITY OF COLLEGE ALTERNATIVES 





Utility 






Public 


Distance 


Public 


Two-year 


(miles) 


University 


College 


0 


3.80 


4.35 


10 


NA 


4.04 


20 


HA 


3.72 


30 


NA 


3.41 


40 


MA 


3.09 


50 


NA 


2.78 


60 


NA 


2.47 


70 


NA 


2.15 



NOTE: NA • not appropriate. 



For :sil distances, given zero dormitory capacity. 



0.108 
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If wi* were Cinucrtud with the enrollment effects uf the two-year college 
.ilone <i.i*.« assuming either that the university did not exist or that 
the student wuuld not be admitted to it)^ the third stage of the slfflula^ 
tiun would involve t ranslormat ion of the utility estimate into an enroll-^ 
ment probability. This would have the logistic form 

w 

P (enrollment) « , 

1 ^ e^ 

where w = yU + YA. The results of this calculation are shown in Fig. 4« 




0 20 40 60 80 



Distonce from home to college (miles) 

Figt4 — Probobiltty of enrollment versus distonce 
for student focing two-^yeor college 

The third stage of our simulation (of the effect of the tvo*-year college 
location decision on enrollment rates and on a student's behavior pat«- 
tern when he lives next door to a university for which he is eligible) 
has two parts. In the first part^ the enrollment rate is calculated in 
the same way as in the single-college models except that the best col- 
lege (i.e.« the one with the highest utility) is entered into the prob- 
ability of enrollment equation* Consequently ^ the two-year college 
stimulates enrollment only when it is at a distance at which its utility 
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vxctfvdti that ot tht- i loso-by university. In the ca«e of our example, 
the utility of tht- two-year college exceeds that of the university when 
thf two-year Institution is within 20 miles of the student. The re- 
sult ot enrollment rate calculation is shown in ¥i^. *>. 




Ol 1 1 I I 

0 20 40 60 80 

Distance from home to two'/ear college 
(miles) 

Fig. 5 — Probability of enrollment versus distance to two-year 
college for a student living near a public university 



Even though the two-year institution stimulates additional enroll- 
ment only if it is within 20 miles of the student. It does Influence 
the distribution of enrollment between the two institutions at all dis- 
tances. In our simulation, we estimate that the likelihood of the stu- 
dent enrolling in a particular institution, if he enrolls, is given by 
the following equation: 



P(a) 




where a and a' are the two institutions. Using the estimated utilities 
of the two institutions (Table 7), we arrive at the division of the stu- 
dent's enrollment between the two institutions sho'rfn in Fig. 6. 

Another way of looking at the distribution of students is from the 
point of view of the university, which faces competition for students 




c 
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II 



0 20 40 60 80 

Distance from home to two-year college (miles) 

Fig .6 — Distribution of enrollment versus distonce 
to twcyear college 

from the proposed two-year college. The effect of the new institution 
on the enrollment of the existing university can be estimated using 
Fi^K. 4 and S. The estimate is sho%m in Fig. 7. 



u 
2 

2. 

C 

s t 

J§ 

1 

c 



0.50 



0.25 



■ Public university enrollment 
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Vlr. SOME THOUG HTS ON NEXT STEPS 

In Ihfir current state^ the utility of our models of college choice 
and college going in the pollcyiaaklng process is somewhat restricted* 
There is a need for both Improved specification and improved data in 
order to increase their utility* An Important limitation of our cur*» 
rent efforts is our inability to specify variables— especially financial 
aid offers — which are important (and* at this time, highly controversial) 
policy instruments* Improving the reliability of our financial aid estl* 
mates in order to examine the effect on student behavior will require 
better data and a better understanding of institutional behavior » the 
key determinant of financial aid offers* Without improved understanding 
of this determination process t the reliability and consequently the 
utility of our models will remain somewhat limited. 

Another important constraint on our model is the lack of currency 
in the data* Many observers have noted that student choice and college-^ 
going behaviors have changed dramatically in recent years* Whether 
these changes are the result of modified desires on the part of students 
or the result of an altered demographic distribution of high school grad- 
uates or of changes in the alternatives students face remains in doubt; 
butt without more recent data» our estimations remain less than thoroughly 
convincing in the current policy debate* 

There Is also a further need to investigate the similarity of stu- 
dent behavior among students from different states* Although our coef- 
ficients derived from an initial run with data on North Carolina students 
are essentially like those derived from data on Illinois^ they differ in 
some important respects* These differences limit the acceptability of 
using one state's coefficient to predict behavior for students in another 
state* The similarity does* however* lead us to believe that more care- 
ful specification may result in patterns of coefficients that are similar 
enough that one state's model will prove useful in predicting behavior 
in another state. 

Another important limitation of our model is our crude handling of 
noncollege alternatives* It is clear that the noncollege alternatives 



ot iMu* studt^nt will ru)t bv the .saoe ai; those of another* although ve 
Implli itly ippvar to assumt* that they are. The characteristics of non- 
mi U^^iato choices are also important aspects of the on-going policy 
dobati'« and onr model provides essentially no information for these 
iDusidorat ions. With the end of the military draft and the new eligi** 
hility tor tfdfral financial aid avrards of students in proprietary and 
puhlii ViU at iunal-terhnical schools* the availability and desirability 
ot nomoUegiate choices has probably changed dramatically. Our model 
must be refined to include the impact of these changes. 

Final ly* improvements will result from application of the model to 
the reaU rather than simulated% policy problems facing post secondary 
decisionmakers* An effort to achieve these improvements is currently 
under way as we introduce the model into the post secondary education 
policymaking process in Florida. This effort will involve gathering 
state data and building simulation routines that are specific to the 
staters higher education environment. 
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A551QTAXE^^ itlST pjF DATA AQl^W^KS 

The tol lowing annotatiuns describe the data sources we used for 
this study. 

The College Entrance Examination Board* Manual <^f Fr^cr.hman Claaa Pro^ 

This source describes the admission decisions of 419 colleges in 
relation to the ability of the applicantSt accepted and enrolled fresh- 
m€*n students. These data were the principal input into the admission 
prediction model. Published biennially* the book contains tables with 
data relevant to the analysis of the institutional decision process. 
Although the level of detail and arrangement of tabular material is not 
uniform for the set of colleges covered* the tables are organized under 
a small number of categories within which presentation is uniform. 

American Council on Education* Institutional Hnuiarnh h'lh^ 

This source provides data on institutional characteristics for 
2119 higher education institutions. The college information contained 
in this file (median student ability* educational expenditures per stu- 
dent* institutional affluence* tuition* financial aid outlays* insti- 
tutional (untrol and type* and range of academic fields) is a primary 
source of data on college attributes. 

As a source of consistent and accurate data* the ACE file has a 
number of disadvantages. First* while ACE data for 4-year colleges and 
universities deal with academic year 1966-67*^data for community colleges 
are for the previous academic year. Second* some data are presented in 
a form which renders them only marginally usable for our study. Third* 
the interpretation of a number of pieces of information is hampered by 
the vagueness of the questions asked* which results in a lack of con- 
sistency of the reported data for different colleges. 

I ns t i t^u t i ona 1_ Lo cation Data 

As a complement to the ACE Institutional Research File* we have 
developed a data file containing the latitude and longitude of iJmost 
every college and university. 
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S«:t>PK is .1 tcMupri^hiMisive survey of 1966 high srhool freshmen and 
seniors. Approxinuitely il^OOO students In earh grade level were sur- 
vived and li*sted. The students* parents were then surveyed^ and the 
senicirs whii rould he located in a college were subsequent ly followed* 
up om* year atttT high school graduation. All students except seniors 
were resurvoyed annually and one year after high school graduation if 
they were enrolled in a college. The students surveyed came frtim 305 
high schools in lour states-*Cal ifornia« Illinois^ Massachuset ts^ and 
North Carol ina. 

The SroPK data iui hide aptitude and achievement scores « parental 
income and education (from student and parent reports) « student's career 
plans« colli^ge enrtUlmt^nt, source of funds for college expenses (from 
student and parent reports)^ college residence type^ etc. 

Wf used thf survfv of 1966 seniors. This survey has the following 
response patti*rn: 



1966 high school seniors 
1966 parental responses 



13,000 
11,700 
17,200 
10,600 



1967 students attending college 
1967 college' respondents 



4% 
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